READSUM: Retrieval-Augmented Adaptive Transformer for Source Code Summarization

نویسندگان

چکیده

Code summarization is the process of automatically generating brief and informative summaries source code to aid in software comprehension maintenance. In this paper, we propose a novel model called READSUM, REtrieval-augmented ADaptive transformer for SUMmarization, that combines both abstractive extractive approaches. Our proposed generates an manner, taking into account structural sequential information input code, while also utilizing approach leverages retrieved summary similar increase frequency important keywords. To effectively blend original at embedding layer stage, obtain augmented representation through multi-head self-attention. addition, develop self-attention network adaptively learns representations encoder stage. Furthermore, design fusion capture relation between decoder The guides generation based on summary. Finally, READSUM extracts keywords using high-quality considers code. We demonstrate superiority various experiments ablation study. Additionally, perform human evaluation assess quality generated

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Preprocessing of Object-Oriented Source Code for Code Retrieval

Object oriented source code occurs in diverse programming languages with documentation using miscellaneous standards, comments in individual styles, or associated test cases that are hard to exploit through information retrieval or knowledge discovery techniques. Typically, the information about object-oriented source code for a software system is distributed across several different sources, w...

متن کامل

A Convolutional Attention Network for Extreme Summarization of Source Code

Attention mechanisms in neural networks have proved useful for problems in which the input and output do not have fixed dimension. Often there exist features that are locally translation invariant and would be valuable for directing the model’s attention, but previous attentional architectures are not constructed to learn such features specifically. We introduce an attentional neural network th...

متن کامل

Source Code Retrieval using Conceptual Similarity

We propose a method for retrieving segments of source code from a large repository. The method is based on conceptual modeling of the code, combining information extracted from the structure of the code and standard informationdistance measures. Our results show an improvement over traditional retrieval models, indicating that, for this type of highly-structured documents, usage of structure is...

متن کامل

Source Code Retrieval Using Sequence Based Similarity

Duplicate code adversely affects the quality of software systems and hence should be detected. We discuss an approach that improves source code retrieval using structural information of source code. A lexical parser is developed to extract control statements and method identifiers from Java programs. We propose a similarity measure that is defined by the ratio of the number of sequential fully ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3271992